Oxidativley stable alpha-amylase

ABSTRACT

Novel alpha-amylase mutants derived from the DNA sequences of naturally occurring or recombinant alpha-amylases are disclosed. The mutant alpha-amylases, in general, are obtained by in vitro modifications of a precursor DNA sequence encoding the naturally occurring or recombinant alpha-amylase to generate the substitution (replacement) or deletion of one or more oxidizable amino acid residues in the amino acid sequence of a precursor alpha-amylase. Such mutant alpha-amylases have altered oxidative stability and/or altered pH performance profiles and/or altered thermal stability as compared to the precursor. Also disclosed are detergent and starch liquefaction compositions comprising the mutant amylases, as well as methods of using the mutant amylases.

RELATED APPLICATIONS

This is a divisional of U.S. Ser. No. 08/194,664 filed Feb. 10, 1994, now pending which is a continuation-in-part of U.S. Ser. No. 08/016,395 filed Feb. 11, 1993 now abandoned.

FIELD OF INVENTION

The present invention relates to novel alpha-amylase mutants having an amino acid sequence not found in nature, such mutants having an amino acid sequence wherein one or more amino acid residue(s) of a precursor alpha-amylase, specifically any oxidizable amino acid, have been substituted with a different amino acid. The mutant enzymes of the present invention exhibit altered stability/activity profiles including but not limited to altered oxidative stability, altered pH performance profile, altered specific activity and/or altered thermostability.

BACKGROUND OF THE INVENTION

Alpha-amylases (alpha-1,4-glucan-4-glucanohydrolase, EC3.2.1.1) hydrolyze internal alpha-1,4-glucosidic linkages in starch largely at random, to produce smaller molecular weight malto-dextrins. Alpha-amylases are of considerable commercial value, being used in the initial stages (liquefaction) of starch processing; in alcohol production; as cleaning agents in detergent matrices; and in the textile industry for starch desizing. Alpha-amylases are produced by a wide variety of microorganisms including Bacillus and Aspergillus, with most commercial amylases being produced from bacterial sources such as B. licheniformis, B. amyloliquefaciens, B. subtilis, or B. stearothermophilus. In recent years the preferred enzymes in commercial use have been those from B. lichenifonnis because of their heat stability and performance, at least at neutral and mildly alkaline pH's.

Previously there have been studies using recombinant DNA techniques to explore which residues are important for the catalytic activity of amylases and/or to explore the effect of modifying certain amino acids within the active site of various analyses (Vihinen, M. et al. (1990) J. Bichem. 107:267-272; Holm, L. et al. (1990) Protein Engineering 3:181-191; Takase, K. et al. (1992) Biochemica et Biophysica Acta, 1120:281-288; Matsui, I. et al. (1992) Febs Letters Vol. 310, No. 3, pp. 216-218); which residues are important for thermal stability (Suzuki, Y. et al. (1989) J. Biol. Chem. 264:18933-18938); and one group has used such methods to introduce mutations at various histidine residues in a B. licheniformis amylase, the rationale for making substitutions at histidine residues was that B. licheniformis amylase (known to be thermostable) when compared to other similar Bacillus amylases, has an excess of histidines and, therefore, it was suggested that replacing a histidine could affect the thermostability of the enzyme (Declerck, N. et al. (1990) J. Biol. Chem. 265:15481-15488; FR 2 665 178-A1; Joyet, P. et al. (1992) Bio/Technology 10:1579-1583).

It has been found that alpha-amylase is inactivated by hydrogen peroxide and other oxidants at pH's between 4 and 10.5 as described in the examples herein. Commercially, alpha-amylase enzymes can be used under dramatically different conditions such as both high and low pH conditions, depending on the commercial application. For example, alpha-amylases may be used in the liquefaction of starch, a process preferably performed at a low pH (pH<5.5). On the other hand, amylases may be used in commercial dish care or laundry detergents, which often contain oxidants such as bleach or peracids, and which are used in much more alkaline conditions.

In order to alter the stability or activity profile of amylase enzymes under varying conditions, it has been found that selective replacement, substitution or deletion of oxidizable amino acids, such as a methionine, tryptophan, tyrosine, histidine or cysteine, results in an altered profile of the variant enzyme as compared to its precursor. Because currently commercially available amylases are not acceptable (stable) under various conditions, there is a need for an amylase having an altered stability and/or activity profile. This altered stability (oxidative, thermal or pH performance profile) can be achieved while maintaining adequate enzymatic activity, as compared to the wild-type or precursor enzyme. The characteristic affected by introducing such mutations may be a change in oxidative stability while maintaining thermal stability or vice versa. Additionally, the substitution of different amino acids for an oxidizable amino acids in the alpha-amylase precursor sequence or the deletion of one or more oxidizable amino acid(s) may result in altered enzymatic activity at a pH other than that which is considered optimal for the precursor alpha-amylase. In other words, the mutant enzymes of the present invention may also have altered pH performance profiles, which may be due to the enhanced oxidative stability of the enzyme.

SUMMARY OF THE INVENTION

The present invention relates to novel alpha-amylase mutants that are the expression product of a mutated DNA sequence encoding an alpha-amylase, the mutated DNA sequence being derived from a precursor alpha-amylase by the deletion or substitution (replacement) of one or more oxidizable amino acid. In one preferred embodiment of the present invention the mutant result from substituting a different amino acid for one or more methionine residue(s) in the precursor alpha-amylase. In another embodiment of the present invention the mutants comprise a substitution of one or more tryptophan residue alone or in combination with the substitution of one or more methionine residue in the precursor alpha-amylase. Such mutant alpha-amylases, in general, are obtained by in vitro modification of a precursor DNA sequence encoding a naturally occurring or recombinant alpha-amylase to encode the substitution or deletion of one or more amino acid residues in a precursor amino acid sequence.

Preferably the substitution or deletion of one or more amino acids in the amino acid sequence is due to the replacement or deletion of one or more methionine, tryptophan, cysteine, histidine or tyrosine residues in such sequence, most preferably the residue which is changed is a methionine residue. The oxidizable amino acid residues may be replaced by any of the other 20 naturally occurring amino acids. If the desired effect is to alter the oxidative stability of the precursor, the amino acid residue may be substituted with a non-oxidizable amino acid (such as alanine, arginine, asparagine, aspartic acid, glutamic acid, glutamine, glycine, isoleucine, leucine, lysine, phenylalanine, proline, serine, threonine, or valine) or another oxidizable amino acid (such as cysteine, methionine, tryptophan, tyrosine or histidine, listed in order of most easily oxidizable to less readily oxidizable). Likewise, if the desired effect is to alter thermostability, any of the other 20 naturally occurring amino acids may be substituted (i.e., cysteine may be substituted for methionine).

Preferred mutants comprise the substitution of a methionine residue equivalent to any of the methionine residues found in B. lichenifornis alpha-amylase (+8, +15, +197, +256, +304, +366 and +438). Most preferably the methionine to be replaced is a methionine at a position equivalent to position +197 or +15 in B. licheniformis alpha-amylase. Preferred substitute amino acids to replace the methionine at position +197 are alanine (A), isoleucine (I), threonine (T) or cysteine (C). The preferred substitute amino acids at position +15 are leucine (L), threonine(T), asparagine (N), aspartate (D), serine (S), valine (V) and isoleucine (I), although other substitute amino acids not specified above may be useful. Two specifically preferred mutants of the present invention are M197T and M15L.

Another embodiment of this invention relates to mutants comprising the substitution of a tryptophan residue equivalent to any of the tryptophan residues found in B. licheniformis alpha-amylase (see FIG. 2). Preferably the tryptophan to be replaced is at a position equivalent to +138 in B. licheniformis alpha-amylase. A mutation (substitution) at a tryptophan residue may be made alone or in combination with mutations at other oxidizable amino acid residues. Specifically, it may be advantageous to modify by substitution at least one tryptophan in combination with at least one methionine (for example, the double mutant +138/+197).

The alpha-amylase mutants of the present invention, in general, exhibit altered oxidative stability in the presence of hydrogen peroxide and other oxidants such as bleach or peracids, or, more specific, milder oxidants such as chloramine-T. Mutant enzymes having enhanced oxidative stability will be useful in extending the shelf life and bleach, perborate, percarbonate or peracid compatibility of amylases used in cleaning products. Similarly, reduced oxidative stability may be useful in industrial processes that require the rapid and efficient quenching of enzymatic activity. The mutant enzymes of the present invention may also demonstrate a broadened pH performance profile whereby mutants such as M15L show stability for low pH starch liquefaction and mutants such as M197T show stability at high pH cleaning product conditions. The mutants of the present invention may also have altered thermal stability whereby the mutant may have enhanced stability at either high or low temperatures. It is understood that any change (increase or decrease) in the mutant's enzymatic characteristic(s), as compared to its precursor, may be beneficial depending on the desired end use of the mutant alpha-amylase.

In addition to starch processing and cleaning applications, variant amylases of the present invention may be used in any application in which known amylases are used, for example, variant amylases can be used in textile processing, food processing, etc. Specifically, it is contemplated that a variant enzyme such as M197C, which is easily inactivated by oxidation, would be useful in a process where it is desirable to completely remove amylase activity at the end of the process, for example, in frozen food processing applications.

The preferred alpha-amylase mutants of the present invention are derived from a Bacillus strain such as B. licheniformis, B. amyloliquefaciens, and B. stearothennophilus, and most preferably from Bacillus lichenifornis.

In another aspect of the present invention there is provided a novel form of the alpha-amylase normally produced by B. licheniformis. This novel form, designated as the A4 form, has an additional four alanine residues at the N-terminus of the secreted amylase. (FIG. 4b.) Derivatives or mutants of the A4 form of alpha-amylase are encompassed within the present invention. By derivatives or mutants of the A4 form, it is meant that the present invention comprises the A4 form alpha-amylase containing one or more additional mutations such as, for example, mutation (substitution, replacement or deletion) of one or more oxidizable amino acid(s).

In a composition embodiment of the present invention there are provided detergent compositions, liquid, gel or granular, comprising the alpha-amylase mutants described herein. Particularly preferred are detergent compositions comprising a +197 position mutant either alone or in combination with other enzymes such as endoglycosidases, celluloses, proteases, lipases or other amylase enzymes. Additionally, it is contemplated that the compositions of the present invention may include an alpha-amylase mutant having more than one site-specific mutation.

In yet another composition embodiment of the present invention there are provided compositions useful in starch processing and particularly starch liquefaction. The starch liquefaction compositions of the present invention preferably comprise an alpha-amylase mutant having a substitution or deletion at position M15. Additionally, it is contemplated that such compositions may comprise additional components as known to those skilled in the art, including, for example, antioxidants, calcium, ions, etc.

In a process aspect of the present invention there are provided methods for liquefying starch, and particularly granular starch slurries, from either a wet or dry milled process. Generally, in the first step of the starch degradation process, the starch slurry is gelatinized by heating at a relatively high temperature (up to about 110° C.). After the starch slurry is gelatinized it is liquefied and dextrinized using an alpha-amylase. The conditions for such liquefaction are described in commonly assigned U.S. patent application Ser. Nos. 07/785,624 and 07/785,623 and U.S. Pat. No. 5,180,669, the disclosure of which are incorporated herein by reference. The present method for liquefying starch comprises adding to a starch slurry an effective amount of an alpha-amylase of the present invention, alone or in combination with additional excipients such as an antioxidant, and reacting the slurry for an appropriate time and temperature to liquefy the starch.

A further aspect of the present invention comprises the DNA encoding the mutant alpha-amylases of the present invention (including A4 form and mutants thereof) and expression vectors encoding the DNA as well as host cells transformed with such expression vectors.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows the DNA sequence of the gene for alpha-amylase from B. lichenformis (NCIB8061), Seq ID No 31, and deduced translation product as described in Gray, G. et al. (1986) J. Bacter. 166:635-643.

FIG. 2 shows the amino acid sequence of the mature alpha-amylases enzyme from B. licheniformis (NCIB8061), Seq ID No 32.

FIG. 3 shows an alignment of primary structures of Bacillus alpha-amylases. The B. licheniformis amylase (Am-Lich), Seq ID No 33, is described by Gray, G. et al. (1986) J. Bact. 166:635-643; the B. amyloliquefaciens amylase (Am-Amylo), Seq ID No 34, is described by Takkinen, K. et al. (1983) J. Biol. Chem. 258:1007-1013; and the B. stearothermophilus (Am-Stearo), Seq ID No 35, is described by Ihara, H. et al. (1985) J. Biochem. 98:95103.

FIG. 4a shows the amino acid sequence of the mature alpha-amylase variant M197T, Seq ID No 36.

FIG. 4b shows the amino acid sequence of the A4 form of alpha-amylase from B. licheniformis NCIB8061, Seq ID No 37. Numbering is from the N-terminus, starting with the four additional alanines.

FIG. 5 shows plasmid pA4BL wherein BLAA refers to B. licheniformis alpha-amylase gene, Pstl to SstI; Amp^(R) refers to the ampicillin-resistant gene from pBR322; and CAT refers to the Chloramphenicol-resistant gene from pC194.

FIG. 6 shows the signal sequence-mature protein junctions for B. licheniformis (Seq ID No 38), B. subtilis (Seq ID No 39), B. licheniformis in pA4BL (Seq ID No 40) and B. licheniformis in pBLapr (Seq ID No 41).

FIG. 7a shows inactivation of certain alpha-amylases (Spezyme® AA20 and M197L (A4 form) with 0.88M H₂ O₂ at pH 5.0, 25° C.

FIG. 7b shows inactivation of certain alpha-amylases (Spezyme® AA20, M197T) with 0.88M H₂ O₂ at pH 10.0, 25° C.

FIG. 7c shows inactivation of certain alpha-amylases (Spezyme® AA20, MI5L) with 0.88M H₂ O₂ at pH 5.0, 25° C.

FIG. 8 shows a schematic for the production of M197X cassette mutants.

FIG. 9 shows expression of M197X variants.

FIG. 10 shows thermal stability of M197X variants at pH 5.0, 5mM CaCl₂ at 95° C. for 5 mins.

FIGS. 11a and 11b show inactivation of certain amylases in automatic dish care detergents.

FIG. 11a shows the stability of certain amylases in Cascades ™(a commercially available dish care product) at 65° C. in the presence or absence of starch. FIG. 11b shows the stability of certain amylases in Sunlight™(a commercially available dish care product) at 65° C. in the presence or absence of starch.

FIG. 12 shows a schematic for the production of M15X cassette mutants.

FIG. 13 shows expression of M15X variants.

FIG. 14 shows specific activity of M15X variants on soluble starch.

FIG. 15 shows heat stability of M15X variants at 90° C., pH 5.0, 5 mM CaCl₂, 5 mins.

FIG. 16 shows specific activity on starch and soluble substrate, and performance in jet liquefaction at pH 5.5, of M15 variants as a function of percent activity of B. licheniformis wild-type.

FIG. 17 shows the inactivation of B. ichenifonmis alpha-amylase (AA20 at 0.65 mg/ml) with chloramine-T at pH 8.0 as compared to variants M197A (1.7 mg/ml) and M197L (1.7 mg/ml).

FIG. 18 shows the inactivation of B. licheniformis alpha-amylase (AA20 at 0.22 mg/ml) with chloramine-T at pH 4.0 as compared to variants M197A (4.3 mg/ml) and M197L (0.53 mg/ml).

FIG. 19 shows the reaction of B. licheniformis alpha-amylase (AA20 at 0.75 mg/ml) with chloramine-T at pH 5.0 as compared to double variants M197T/W138F (0.64 mg/ml) and M197T/W138Y (0.60 mg/ml).

DETAILED DESCRIPTION OF THE INVENTION

It is believed that amylases used in starch liquefaction may be subject to some form of inactivation due to some activity present in the starch slurry (see commonly owned U.S. application Ser. Nos. 07/785,624 and 07/785,623 and U.S. Pat. No.5,180,669, issued Jan. 19, 1993, incorporated herein by reference). Furthermore, use of an amylase in the presence of oxidants, such as in bleach or peracid containing detergents, may result in partial or complete inactivation of the amylase. Therefore, the preset invention focuses on altering the oxidative sensitivity of amylases. The mutant enzymes of the present invention may also have an altered pH profile and/or altered thermal stability which may be due to the enhanced oxidative stability of the enzyme at low or high pH's.

Alpha-amylase as used herein includes naturally occurring amylases as well as recombinant amylases. Preferred amylases in the present invention are alpha-amylases derived from B. licheniformis or B. stearothermophilus, including the A4 form of alpha-amylase derived from B. licheniformis as described herein, as well as fungal alpha-amylases such as those derived from Aspergillus (i.e., A. oryzae and A. niger).

Recombinant alpha-amylases refers to an alpha-amylase in which the DNA sequence encoding the naturally occurring alpha-amylase is modified to produce a mutant DNA sequence which encodes the substitution, insertion or deletion of one or more amino acids in the alpha-amylase sequence. Suitable modification methods are disclosed herein, and also in commonly owned U.S. Pat. Nos. 4,760,025 and 5,185,258, the disclosure of which are incorporated herein by reference.

Homologies have been found between almost all endo-amylases sequenced to date, ranging from plants, mammals, and bacteria (Nakajima, R. T. et al. (1986) Appl. Microbiol. Biotechnol. 23:355-360; Rogers, J. C. (1985) Biochem. Biophys. Res. Commun. 128:470-476). There are four areas of particularly high homology in certain Bacillus amylases, as shown in FIG. 3, wherein the underlined sections designate the areas of high homology. Further, sequence alignments have been used to map the relationship between Bacillus endo-amylases (Feng, D. F. and Doolittle, R. F. (1987) J. Molec. Evol. 35:351-360). The relative sequence homology between B. stearothermophilus and B. licheniformis amylase is about 66%, as determined by Holm, L. et al. (1990) Protein engineering (3) pp. 181-191. The sequence homology between B. licheniformis and B. amyloliquefaciens amylases is about 81%, as per Holm, L. et al., supra. While sequence homology is important, it is generally recognized that structural homology is also important in comparing amylases or other enzymes. For example, structural homology between fungal amylases and bacterial (Bacillus) amylase have been suggested and, therefore, fungal amylases are encompassed within the present invention.

An alpha-amylase mutant has an amino acid sequence which is derived from the amino acid sequence of a precursor alpha-amylase. The precursor alpha-amylases include naturally occurring alpha-amylases and recombinant alpha-amylases (as defined). The amino acid sequence of the alpha-amylase mutant is derived from the precursor alpha-amylase amino acid sequence by the substitution, deletion or insertion of one or more amino acids of the precursor amino acid sequence. Such modification is of the precursor DNA sequence which encodes the amino acid sequence of the precursor alpha-amylase rather than manipulation of the precursor alpha-amylase enzyme per se. Suitable methods for such manipulation of the precursor DNA sequence include methods disclosed herein and in commonly owned U.S. Pat. Nos. 4,760,025 and 5,185,258.

Specific residues corresponding to positions M197, M15 and W138 of Bacillus licheniformis alpha-amylase are identified herein for substitution or deletion, as are all methionine, histidine, tryptophan, cysteine and tyrosine positions. The amino acid position number (i.e., +197) refers to the number assigned to the mature Bacillus lichenifonnis alpha-amylase sequence presented in FIG. 2. The invention, however, is not limited to the mutation of this particular mature alpha-amylase (B. licheniformis) but extends to precursor alpha-amylases containing amino acid residues at positions which are equivalent to the particular identified due in B. licheniformis alpha-amylase. A residue (amino acid) of a precursor alpha-amylase is equivalent to a residue of B. licheniformis alpha-amylase if it is either homologous (i.e., corresponding in position in either primary or tertiary structure) or analogous to a specific residue or portion of that residue in B. licheniformis alpha-amylase (i.e., having the same or similar functional capacity to combine, react, or interact chemically or structurally).

In order to establish homology to primary structure, the amino acid sequence of a precursor alpha-amylase is directly compared to the B. licheniformis alpha-amylase primary sequence and particularly to a set of residues known to be invariant to all alpha-amylases for which sequence is known, as seen in FIG. 3. It is possible also to determine equivalent residues by tertiary structure: crystal structures have been reported for porcine pancreatic alpha-amylase (Buisson, G. et al. (1987) EMBO J.6:3909-3916); Taka-amylase A from Aspergillus oryzae (Matsuura, Y. et al. (1984) J. Biochem. (Tokyo) 95:697-702); and an acid alpha-amylase from A. niger (Boel, E. et al. (1990) Biochemistry 29:6244-6249), with the former two structures being similar. There are no published structures for Bacillus alpha-amylases, although there are predicted to be common super-secondary structures between glucanases (MacGregor, E. A. & Svensson, B. (1989) Biochem. J. 259:145-152) and a structure for the B. stearothermophilus enzyme has been modeled on that of Taka-amylase A (Holm, L. et al. (1990) Protein Engineering 3:181-191). The four highly conserved regions shown in FIG. 3 contain many residues thought to be part of the active-site (Matsuura, Y. et al. (1984) J. Biochem. (Tokyo) 95:697-702; Buisson, G. et al. (1987) EMBO J. 6:3909-3916; Vihinen, M. et al. (1990) J. Biochem. 107:267-272) including, in the licheniformis numbering, His105; Arg229; Asp231; His235; Glu261 and Asp328.

Expression vector as used herein refers to a DNA construction containing a DNA sequence which is operably linked to a suitable control sequence capable of effecting the expression of said DNA in a suitable host. Such control sequences may include a promoter to effect transcription, an optional operator sequence to control such transcription, a sequence encoding suitable mRNA ribosome-binding sites, and sequences which control termination of transcription and translation. A preferred promoter is the B. subtilis aprE promoter. The vector may be a plasmid, a phage particle, or simply a potential genomic insert. Once transformed into a suitable host, the vector may replicate and function independently of the host genome, or may, in some instances, integrate into the genome itself. In the present specification, plasmid and vector are sometimes used interchangeably as the plasmid is the most commonly used form of vector at present. However, the invention is intended to include such other forms of expression vectors which serve equivalent functions and which are, or become, known in the art.

Host strains (or cells) useful in the present invention generally are procaryotic or eucaryotic hosts and include any transformable microorganism in which the expression of alpha-amylase can be achieved. Specifically, host strains of the same species or genus from which the alpha-amylase is derived are suitable, such as a Bacillus strain. Preferably an alpha-amylase negative Bacillus strain (genes deleted) and/or an alpha-amylase and protease deleted Bacillus strain such as Bacillus subtilis strain BG2473 (ΔamyE, Δapr, Δnpr) is used. Host cells are transformed or transfected with vectors constructed using recombinant DNA techniques. Such transformed host cells are capable of either replicating vectors encoding the alpha-amylase and its variants (mutants) or expressing the desired alpha-amylase.

Preferably the mutants the present invention are secreted into the culture medium during fermentation. Any suitable signal sequence, such as the aprE signal peptide, can be used to achieve secretion.

Many of the alpha-amylase mutants of the present invention are useful in formulating various detergent compositions, particularly certain dish care cleaning compositions, especially those cleaning compositions containing known oxidants. Alpha-amylase mutants of the invention can be formulated into known powdered, liquid or gel detergents having pH between 6.5 to 12.0. Suitable granular composition may be made as described in commonly owned U.S. patent application Ser. Nos. 07/429,881, 07/533,721 and 071957,973, all of which are incorporated herein by reference. These detergent cleaning compositions can also contain other enzymes, such as known proteases, lipases, celluloses, endoglycosidases or other amylases, as well as builders, stabilizers or other excipients known to those skilled in the art. These enzymes can be present as co-granules or as blended mixes or in any other manner known to those skilled in the art. Furthermore, it is contemplated by the present invention that multiple mutants may be useful in cleaning or other applications. For example, a mutant enzyme having changes at both +15 and +197 may exhibit enhanced performance useful in a cleaning product or a multiple mutant comprising changes at +197 and +138 may have improved performance.

As described previously, alpha-amylase mutants of the present invention may also be useful in the liquefaction of starch. Starch liquefaction, particularly granular starch slurry liquefaction, is typically carried out at near neutral pH's and high temperatures. As described in commonly owned U.S. application Ser. Nos. 07/788,624 and 071785,623 and U.S. Pat. No. 5,180,669, it appears that an oxidizing agent or inactivating agent of some sort is also present in typical liquefaction processes, which may affect the enzyme activity; thus, in these related patent applications an antioxidant is added to the process to protect the enzyme.

Based on the conditions of a preferred liquefaction process, as described in commonly owned U.S. application Ser. Nos. 071788,624 and 071785,623 and U.S. Pat. No. 5,180,669, namely low pH, high temperature and potential oxidation conditions, preferred mutants of the present invention for use in liquefaction processes comprise mutants exhibiting altered pH performance profiles (i.e., low pH profile, pH <6 and preferably pH <5.5), and/or altered thermal stability (i.e., high temperature, about 900°-110° C.), and/or altered oxidative stability (i.e., enhanced oxidative stability).

Thus, an improved method for liquefying starch is taught by the present invention, the method comprising liquefying a granular starch slurry from either a wet or dry milling process at a pH from about 4 to 6 by adding an effective amount of an alpha-amylase mutant of the present invention to the starch slurry; optionally adding an effective amount of an antioxidant or other excipient to the slurry; and reacting the slurry for an appropriate time and temperature to liquefy the starch.

The following is presented by way of example and is not to be construed as a limitation to the scope of the claims. Abbreviations used herein, particularly three letter or one letter notations for amino acids are described in Dale, J. W., Molecular Genetics of Bacteria, John Wiley & Sons, (1989) Appendix B.

Experimental

EXAMPLE 1 Substitutions for the Methionine Residues in B. licheniformis Alpha-Amylase

The alpha-amylase gene (FIG. 1) was cloned from B. licheniformis NCIB8061 obtained from the National Collection of Industrial Bacteria, Aberdeen, Scotland (Gray, G. et al. (1986) J. Bacteriology 166:635-643). The 1.72kb Pstl-Sstl fragment, encoding the last three residues of the signal sequence; the entire mature protein and the terminator region was subcloned into M13MP18. A synthetic terminator was added between the Bcll and Sstl sites using a synthetic oligonucleotide cassette of the form:

    __________________________________________________________________________     BclI                                      SstI                                 __________________________________________________________________________     5' GATCAAAACATAAAAAACCGGCCTTGGCCCCGCCGGTTTTTTATTATTTTTGAGCT                                                              3'                                                                               Seq ID No 1                        3' TTTTGTATTTTTTGGCCGGAACCGGGGCGGCCAAAAAATAATAAAAAC                                                                      5'                                   __________________________________________________________________________

designed to contain the B. amyloliquefaciens subtilisin transcriptional terminator (Wells et al. (1983) Nucleic Acid Research 11:7911-7925).

Site-directed mutagenesis by oligonucleotides used essentially the protocol of Zoller, M. et al. (1983) Meth. Enzymol. 100:468-500: briefly, 5'-phosphorylated oligonucleotide primers were used to introduce the desired mutations on the M13 single-stranded DNA template using the oligonucleotides listed in Table I to substitute for each of the seven methionines found in B. licheniformis alpha-amylase. Each mutagenic oligonucleotide also introduced a restriction endonuclease site to use as a screen for the linked mutation.

                  TABLE 1                                                          ______________________________________                                         Mutagenic Oligonucleotides for the Substitution of the                         Methionine Residues in B. licheniformis Alpha-Amylase                           ##STR1##                  Seq ID No 2                                          ##STR2##                  Seq ID No 3                                          ##STR3##                  Seq ID No 4                                          ##STR4##                  Seq ID No 5                                          ##STR5##                  Seq ID No 6                                          ##STR6##                  Seq ID No 7                                          ##STR7##                  Seq ID No 8                                          ##STR8##                  Seq ID No 9                                         ______________________________________                                          Bold letter indicate base changes introduced by oligonucleotide.   -           Codon changes indicated in teh form M8A, where methionine (M) at position      + 8 has been changed to alanine (A).   -                                       Underlining indicates restriction endonuclease site introduced by              oligonucleotide                                                          

The heteroduplex was used to transfect E. coli mutL cells (Kramer et al. (1984) Cell 38:879) and, after plaque-purification, clones were analyzed by restriction analysis of the RF1's. Positives were confirmed by dideoxy sequencing (Sanger et al. (1977) Proc. Natl. Acad. Sci. U.S.A. 74:5463-5467) and the Pstl-Sstl fragments for each subcloned into an E. coli vector, plasmid pA4BL.

Plasmid pA4BL

Following the methods described in U.S. application Ser. No. 860,468 (Power et al.), which is incorporated herein by reference, a silent Pstl site was introduced at codon +1 (the first amino-acid following the signal cleavage site) of the aprE gene from pS168-1 (Stahl, M. L. and Ferrari, E. (1984) J. Bacter. 158:411-418). The sprE promoter and signal peptide region was then cloned out of a pJH101 plasmid (Ferrari, F. A. et al. (1983) J. Bacter. 154:1513-1515) as a Hindlll-Pstl fragment and subcloned into the pUC18-derived plasmid JM102 (Ferrari, E. and Hoch, J. A. (1989) Bacillus, ed. C. R. Harwood, Plenum Pub., pp. 57-72). Addition of the Pstl-Sstl fragment from B. licheniformis alpha-amylase gave pA4BL (FIG. 5) having the resulting aprE signal peptide-amylase junction as shown in FIG. 6.

Transformation Into B. subtilis

pA4BL is a plasmid able to replicate in E. coli and integrate into the B. subtilis chromosome. Plasmids containing different variants were transformed into B. subtilis (Anagnostopoulos, C. and Spizizen, J. (1961) J. Bacter. 81:741-746) and integrated into the chromosome at the aprE locus by a Campbell-type mechanism (Young, M. (1984) J. Gen. Microbiol. 130:1613-1621). The Bacillus subtilis strain BG2473 was a derivative of l168 which had been deleted for amylase (ΔamyE) and two proteases (Δapr, Δnpr) (Stahl, M. L. and Ferrari, E., J. Bacter. 158:411-418 and U.S. Pat. Ser. No. 5,264,366, incorporated herein by reference). After transformation the sacU32(Hy) (Henner, D. J. et al. (1988) J. Bacter. 170:296-300) mutation was introduced by PBS-1 mediated transduction (Hoch, J. A. (1983) 154:1513-1515).

N-terminal analysis of the amylase expressed from pA4BL in B. subtilis showed it to be processed having four extra alanines at the N-terminus of the secreted amylase protein ("A4 form"). These extra residues had no significant, deleterious effect on the activity or thermal stability of the A4 form and in some applications may enhance performance. In subsequent experiments the correctly processed forms of the licheniformis amylase and the variant M197T were made from a very similar construction (see FIG. 6). Specifically, the 5' end of the A4 construction was subcloned on an EcoRI-Sstll fragment, from pA4BL (FIG. 5) into M13BM20 (Boehringer Mannheim) in order to obtain a coding-strand template for the mutagenic oligonucleotide below:

    __________________________________________________________________________     5'-CAT CAG CGT CCC ATT AAG ATT TGC AGC CTG CGC AGA CAT GTT GCT-3' Seq ID       No 10                                                                          __________________________________________________________________________

This primer eliminated the codons for the extra four N-terminal alanines, correct forms being screened for by the absence of the Pstl site. Subcloning the EcoRI-Sstll fragment back into the pA4BL vector (FIG. 5) gave plasmid pBLapr. The M197T substitution could then be moved, on a Sstil-Sstl fragment, out of pA4BL (M197T) into the complementary pBLapr vector to give plasmid pBLapr (M197T). N-terminal analysis of the amylase expressed from pBLapr in B. subtilis showed it to be processed with the same N-terminus found in B. licheniformis alpha-amylase.

EXAMPLE 2 Oxidative Sensitivity of Methionine Variants

B. licheniformis alpha-amylase, such as Spezyme® AA20 (commercially available from Genencor International, Inc.), is inactivated rapidly in the presence of hydrogen peroxide (FIG. 7). Various methionine variants were expressed in shake-flask cultures of B. subtilis and the crude supernatants purified by ammonium sulphate cuts. The amylase was precipitated from a 20% saturated ammonium sulphate supernatant by raising the ammonium sulphate to 70% saturated, and then resuspended. The variants were then exposed to 0.88M hydrogen peroxide at pH 5.0, at 25° C. Variants at six of the methionine positions in B. licheniformis alpha-amylase were still subject to oxidation by peroxide while the substitution at position +197 (M197L) showed resistance to peroxide oxidation. (See FIG. 7.) However, subsequent analysis described in further detail below showed that while a variant may be susceptible to oxidation at pH 5.0, 250° C., it may exhibit altered/enhanced properties under different conditions (i.e., liquefaction).

EXAMPLE 3 Construction of All Possible Variants at Position 197

All of the M197 variants (M197X) were produced in the A4 form by cassette mutagenesis, as outlined in FIG. 8:

1) Site directed mutagenesis (via primer extension in M13) was used to make M197A using the mutagenic oligonucleotide below: ##STR9## which also inserted an EcoRV site (codons 200-201) to replace the ClaI site (codons 201-202).

2) Then primer LAAM12 (Table II) was used to introduce another silent restriction site (BstBI) over codons 186-188.

3) The resultant M197A (BstBl+, EcoRV+) variant was then subcloned (Pstl-Sstl fragment) into plasmid pA4BL and the resultant plasmid digested with BstBI and EcoRV and the large vector-containing fragment isolated by electroelution from agarose gel.

4) Synthetic primers LAAM14-30 (Table II) were each annealed with the largely complementary common primer LAAM13 (Table II). The resulting cassettes encoded for all the remaining naturally occurring amino acids at position +197 and were ligated, individually, into the vector fragment prepared above.

                                      TABLE II                                     __________________________________________________________________________     Synthetic Oligonucleotides Used for Cassette Mutagenesis                       to Produce M197X Variants                                                      __________________________________________________________________________     LAAM12                                                                              GG GAA GTT TCG AAT GAA AAC G               Seq ID No 12                   LAAM13                                                                              X197bs                                     Seq ID No 13                        (EcoRV)                                                                             GTC GGC ATA TG CAT ATA ATC ATA GTT GCC GTT TTC ATT (BstBI)           LAAM14                                                                              I197                                       Seq ID No 14                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG ATC TAT GCC GAC (RcoRV-)      LAAM15                                                                              F197                                       Seq ID No 15                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG TTC TAT GCC GAC (RcoRV-)      LAAM16                                                                              V197                                       Seq ID No 16                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG GTT TAT GCC GAC (RcoRV-)      LAAM17                                                                              S197                                       Seq ID No 17                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG AGC TAT GCC GAC (RcoRV-)      LAAM18                                                                              P197                                       Seq ID No 18                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG CCT TAT GCC GAC (RcoRV-)      LAAM19                                                                              T197                                       Seq ID No 19                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG ACA TAT GCC GAC (RcoRV-)      LAAM20                                                                              Y197                                       Seq ID No 20                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG TAC TAT GCC GAC (RcoRV-)      LAAM21                                                                              H197                                       Seq ID No 21                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG CAC TAT GCC GAC (RcoRV-)      LAAM22                                                                              G197                                       Seq ID No 22                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG GGC TAT GCC GAC (RcoRV-)      LAAM23                                                                              Q197                                       Seq ID No 23                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG CAA TAT GCC GAC (RcoRV-)      LAAM24                                                                              N197                                       Seq ID No 24                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG AAC TAT GCC GAC (RcoRV-)      LAAM25                                                                              K197                                       Seq ID No 25                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG AAA TAT GCC GAC (RcoRV-)      LAAM26                                                                              D197                                       Seq ID No 26                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG GAT TAT GCC GAC (RcoRV-)      LAAM27                                                                              E197                                       Seq ID No 27                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG GAA TAT GCC GAC (RcoRV-)      LAAM28                                                                              C197                                       Seq ID No 28                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG TGT TAT GCC GAC (RcoRV-)      LAAM29                                                                              W197                                       Seq ID No 29                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG TGG TAT GCC GAC (RcoRV-)      LAAM30                                                                              R197                                       Seq ID No 30                        (BstBI)                                                                             CG AAT GAA AAC GGC AAC TAT GAT TAT TTG AGA TAT GCC                   __________________________________________________________________________               GAC (RcoRV-)                                                    

The cassettes were designed to destroy the EcoRV site upon ligation, thus plasmids from E. coli transformants were screened for loss of this unique site. In addition, the common bottom strand of the cassette contained a frame-shift and encoded a Nsil site, thus transformants derived from this strand could be eliminated by screening for the presence of the unique Nsil site and would not be expected, in any case, to lead to expression of active amylase.

Positives by restriction analysis were confirmed by sequencing and transformed in B. subtilis for expression in shake-flask cultures (FIG. 9). The specific activity of certain of the M197X mutants was then determined using a soluble substrate assay. The data generated using the following assay methods are presented below in Table III.

Soluble Substrate Assay: A rate assay was developed based on an end-point assay kit supplied by Megazyme (Aust.) Pty. Ltd.: Each vial of substrate (p-nitrophenyl maltoheptaoside, BPNPG7) was dissolved in 10ml of sterile water, followed by a 1 to 4 dilution in assay buffer (5OmM maleate buffer, pH 6.7, 5mM calcium chloride, 0.002% Tween20). Assays were performed by adding 10 μl of amylase to 790 μl of the substrate in a cuvette at 25° C. Rates of hydrolysis were measured as the rate of change of absorbance at 410nm, after a delay of 75 seconds. The assay was linear up to rates of 0.4 absorption units/min.

The amylase protein concentration was measured using the standard Bio-Rad assay (Bio-Rad Laboratories) based on the method of Bradford, M. (1976) Anal. Biochem. 72:248) using bovine serum albumin standards.

Starch Hydrolysis Assay The standard method for assaying the alpha-amylase activity of Spezyme® AA20 was used. This method is described in detail in Example 1 of U.S. Ser. No. 071785,624, incorporated herein by reference. Native starch forms a blue color with iodine but fails to do so when it is hydrolyzed into shorter dextrin molecules. The substrate is soluble lintner starch 5gm/liter in phosphate buffer, pH 6.2 (42.5gm/liter potassium dihydrogen phosphate, 3.16gm/liter sodium hydroxide). The sample is added in 25mM calcium chloride and activity is measured as the time taken to give a negative iodine test upon incubation at 30° C. Activity is recorded in liquefons per gram or ml (LU) calculated according to the formula: ##EQU1##

                  TABLE III                                                        ______________________________________                                                        SPECIFIC ACTIVITY                                                              (as % of AA20 value) on:                                        ALPHA-AMYLASE    Soluble Substrate                                                                          Starch                                            ______________________________________                                         Spezyme ® AA20                                                                              100         100                                               A4 form          105         115                                               M15L (A4 form)   93          94                                                M15L             85          103                                               M197T (A4 form)  75          83                                                M197T            62          81                                                M197A (A4 form)  88          89                                                M197C            85          85                                                M197L (A4 form)  51          17                                                ______________________________________                                    

EXAMPLE 4 Characterization of Variant M15L

Variant Ml5L made as per the prior examples did not show increased amylase activity (Table III) and was still inactivated by hydrogen peroxide (FIG. 7). It did, however, show significantly increased performance in jet-liquefaction of starch especially at low pH as shown in Table IV below.

Starch liquefaction was typically performed using a Hydroheater M 103-M steam jet equipped with a 2.5 liter delay coil behind the mixing chamber and a terminal back pressure valve. Starch was fed to the jet by a Moyno pump and steam was supplied by a 150 psi steam line, reduced to 90-100 psi. Temperature probes were installed just after the Hydroheater jet and just before the back pressure valve.

Starch slurry was obtained from a corn wet miller and used within two days. The starch was diluted to the desired solids level with deionized water and the pH of the starch was adjusted with 2% NaOH or saturated Na₂ CO₃. Typical liquefaction conditions were:

Starch 32%-35% solids

    ______________________________________                                         Starch         32%-35%  solids                                                 Calcium        40-50    ppm (30 ppm added)                                     pH             5.0-6.0                                                         Alpha-amylase  12-14    LU/g starch dry basis                                  ______________________________________                                    

Starch was introduced into the jet at about 350 ml/min. The jet temperature was held at 105° -107° C. Samples of starch were transferred from the jet cooker to a 95° C. second stage liquefaction and held for 90 minutes.

The degree of starch liquefaction was measured immediately after the second stage liquefaction by determining the dextrose equivalence (DE) of the sample and by testing for the presence of raw starch, both according to the methods described in the Standard Analytical Methods of the Member Companies of the Corn Refiners Association. Inc., sixth edition. Starch, when treated generally under the conditions given above and at pH 6.0, will yield a liquefied starch with a DE of about 10 and with no raw starch. Results of starch liquefaction tests using mutants of the present invention are provided in Table IV.

                  TABLE IV                                                         ______________________________________                                         Performance of Variants M15L (A4 form) and                                     M15L in Starch Liquefaction                                                                   pH  DE after 90 Mins.                                           ______________________________________                                         Spezyme ® AA20                                                                              5.9   9.9                                                     M15L (A4 form)   5.9   10.4                                                    Spezyme ® AA20                                                                              5.2   1.2                                                     M15L (A4 form)   5.2   2.2                                                     Spezyme ® AA20                                                                              5.9   9.3*                                                    M15L             5.9   11.3*                                                   Spezyme ® AA20                                                                              5.5   3.25**                                                  M15L             5.5   6.7**                                                   Spezyme ® AA20                                                                              5.2   0.7**                                                   M15L             5.2   3.65**                                                  ______________________________________                                          *average of three experiments                                                  **average of two experiments                                             

EXAMPLE 5 Construction of M15X Variants

Following generally the processes described in Example 3 above, all variants at M15 (M15X) were produced in native B. licheniformis by cassette mutagenesis, as outlined in FIG. 12:

1) Site directed mutagenesis (via primer extension in M13) was used to introduce unique restriction sites flanking the M15 codon to facilitate insertion of a mutagenesis cassette. Specifically, a BstB1 site at codons 11-13 and a Msc1 site at codons 18-20 were introduced using the two oligonucleotides shown below. ##STR10## 2) The vector for M15X cassette mutagenesis was then constructed by subcloning the Sfi1-Sstll fragment from the mutagenized amylase (BstB1+, Msc1+) into plasmid pBLapr. The resulting plasmid was then digested with BstB1 and Msc1 and the large vector fragment isolated by electroelution from a polyacrylamide gel.

3) Mutagenesis cassettes were created as with the M197X variants. Synthetic oligomers, each encoding a substitution at codon 15, were annealed to a common bottom primer. Upon proper ligation of the cassette to the vector, the Msc1 is destroyed allowing for screening of positive transformants by loss of this site. The bottom primer contains an unique SnaB1 site allowing for the transformants derived from the bottom strand to be eliminated by screening for the SnaB1 site. This primer also contains a frameshift which would also eliminate amylase expression for the mutants derived from the common bottom strand.

The synthetic cassettes are listed in Table V and the general cassette mutagenesis strategy is illustrated in FIG. 12.

                                      TABLE V                                      __________________________________________________________________________     Synthetic Oligonucleotides Used for Cassette Mutagenesis                       to Produce M15X Variants                                                       __________________________________________________________________________     M15A (BstB1) C GAA TGG TAT GCT CCC AAT GAC GG (Msc1)                                                         Seq ID No 50                                     M15R (BstB1) C GAA TGG TAT CGC CCC AAT GAC GG (Msc1)                                                         Seq ID No 51                                     M15N (BstB1) C GAA TGG TAT AAT CCC AAT GAC GG (Msc1)                                                         Seq ID No 52                                     M15D (BstB1) C GAA TGG TAT GAT CCC AAT GAC GG (Msc1)                                                         Seq ID No 53                                     M15H (BstB1) C GAA TGG TAT CAC CCC AAT GAC GG (Msc1)                                                         Seq ID No 54                                     M15K (BstB1) C GAA TGG TAT AAA CCC AAT GAC GG (Msc1)                                                         Seq ID No 55                                     M15P (BstB1) C GAA TGG TAT CCG CCC AAT GAC GG (Msc1)                                                         Seq ID No 56                                     M15S (BstB1) C GAA TGG TAT TCT CCC AAT GAC GG (Msc1)                                                         Seq ID No 57                                     M15T (BstB1) C GAA TGG TAT ACT CCC AAT GAC GG (Msc1)                                                         Seq ID No 58                                     M15V (BstB1) C GAA TGG TAT GTT CCC AAT GAC GG (Msc1)                                                         Seq ID No 59                                     M15C (BstB1) C GAA TGG TAT TGT CCC AAT GAC GG (Msc1)                                                         Seq ID No 60                                     M15Q (BstB1) C GAA TGG TAT CAA CCC AAT GAC GG (Msc1)                                                         Seq ID No 61                                     M15E (BstB1) C GAA TGG TAT GAA CCC AAT GAC GG (Msc1)                                                         Seq ID No 62                                     M15G (BstB1) C GAA TGG TAT GGT CCC AAT GAC GG (Msc1)                                                         Seq ID No 63                                     M15I (BstB1) C GAA TGG TAT ATT CCC AAT GAC GG (Msc1)                                                         Seq ID No 64                                     M15F (BstB1) C GAA TGG TAT TTT CCC AAT GAC GG (Msc1)                                                         Seq ID No 65                                     M15W (BstB1) C GAA TGG TAT TGG CCC AAT GAC GG (Msc1)                                                         Seq ID No 66                                     M15Y (BstB1) C GAA TGG TAT TAT CCC AAT GAC GG (Msc1)                                                         Seq ID No 67                                     M15Z (Msc1) C GTC ATT GGG ACT ACG TAC CAT T (BstB1)                                                          Seq ID No 68                                     (bottom strand)                                                                __________________________________________________________________________      Underline indicates codon changes at amino acid position 15.                   Conservative substitutions were made in some cases to prevent introductio      of new restriction sites.                                                

EXAMPLE 6 Bench Liquefaction with M15X Variants

Eleven alpha-amylase variants with substitutions for M15 made as per Example 5 were assayed for activity, as compared to Spezyme® AA20 (commercially available from Genencor International, Inc.) in liquefaction at pH 5.5 using a bench liquefaction system. The bench scale liquefaction system consisted of a stainless steel coil (0.25 inch diameter, approximately 350 ml volume) equipped with a 7 inch long static mixing element approximately 12 inches from the anterior end and a 30 psi back pressure valve at the posterior end. The coil, except for each end, was immersed in a glycerol-water bath equipped with thermostatically controlled heating elements that maintained the bath at 105° -106° C.

Starch slurry containing enzyme, maintained in suspension by stirring, was introduced into the reaction coil by a piston driven metering pump at about 70 ml/min. The starch was recovered from the end of the coil and was transferred to the secondary hold (95° C. for 90 minutes). Immediately after the secondary hold, the DE of the liquefied starch was determined, as described in Example 4. The results are shown in FIG. 16.

EXAMPLE 7 Characterization of M197X Variants

As can be seen in FIG. 9, there was a wide range of amylase activity (measured in the soluble substrate assay) expressed by the M197X (A4 form) variants. The amylases were partially purified from the supernatants by precipitation with two volumes of ethanol and resuspension. They were then screened for thermal stability (FIG. 10) by heating at 95° C. for 5 minutes in 10 mM acetate buffer pH 5.0, in the presence of 5 mM calcium chloride; the A4 wild-type retained 28% of its activity after incubation. For M197W and M197P we were unable to recover active protein from the supernatants. Upon sequencing, the M197H variant was found to contain a second mutation, N190K. M1 97L was examined in a separate experiment and was one of the lowest thermally stable variants. There appears to be a broad correlation between expression of amylase activity and thermal stability. The licheniformis amylase is restricted in what residues it can accommodate at position 197 in terms of retaining or enhancing thermal stability: cysteine and threonine are preferred for maximal thermal stability under these conditions whereas alanine and isoleucine are of intermediate stability. However, other substitutions at position +197 result in lowered thermal stability which may be useful for other applications. Additionally, different substitutions at +197 may have other beneficial properties, such as altered pH performance profile or altered oxidative stability. For example, the M197C variant was found to inactivate readily by air oxidation but had enhanced thermal stability. Conversely, compared to the M197L variant, both M197T and M197A retained not only high thermal stability (FIG. 10), but also high activity (Table III), while maintaining resistance to inactivation by peroxide at pH 5 to pH 10 (FIG. 7).

EXAMPLE 8 Stability and Performance in Detergent Formulation

The stability of the M197T (A4 form), M197T and M197A (A4 form) was measured in automatic dish care detergent (ADD) matrices. 2ppm Savinase™(a protease, commercially available from Novo Industries, of the type commonly used in ADD) were added to two commercially available bleach-containing ADD's: Cascad™(Procter and Gamble, Ltd.) and Sunlight™(Unilever) and the time course of inactivation of the amylase variants and Termamyl™(a thermally stable alpha-amylase available from Novo Nordisk, A/S) followed at 65° C. The concentration of ADD product used in both cases was equivalent to `pre-soak` conditions: 14gm product per liter of water (7 grams per gallon hardness). As can be seen (FIGS. 11a and 11b), both form of the M197T variant were much more stable than Termamyl™ and M197A (A4 form), which were inactivated before the first assay could be performed. This stability benefit was seen in the presence or absence of starch as determined by the following protocol. Amylases were added to 5ml of ADD and Savinase™, prewarmed in a test tube and, after vortexing, activities were assayed as a function of time, using the soluble substrate assay. The "+starch" tube had spaghetti starch baked onto the sides (140° C., 60 mins.). The results are shown in FIGS. 11a and 11b.

EXAMPLE 9 Characterization of M15X Variants

All M15X variants were propagated in Bacillus subtilis and the expression level monitored as shown in FIG. 13. The amylase was isolated and partially purified by a 20-70% ammonium sulfate cut. The specific activity of these variants on the soluble substrate was determined as per Example 3 (FIG. 14). Many of the M15X amylases have specific activities greater than that of Spezyme® AA20. A benchtop heat stability assay was performed on the variants by heating the amylase at 90° C. for 5 min. in 50 mM acetate buffer pH 5 in the presence of 5 mM CaCI₂ (FIG. 15). Most of the variants performed as well as Spezyme® AA20 in this assay. Those variants that exhibited reasonable stability in this assay (reasonable stability defined as those that retained at least about 60% of Spezyme® AA20's heat stability) were tested for specific activity on starch and for liquefaction performance at pH 5.5. The most interesting of those mutants are shown in FIG. 16. M15D, N and T, along with L, out performed Spezyme® AA20 in liquefaction at pH 5.5 and have increased specific activities in both the soluble substrate and starch hydrolysis assays.

Generally, we have found that by substituting for the methionine at position 15, we can provide variants with increased low pH-liquefaction performance and/or increased specific activity.

EXAMPLE 10 Tryptophan Sensitivity to Oxidation

Chloramine-T (sodium N-chloro-p-toluenesulfonamide) is a selective oxidant, which oxidizes methionine to methionine sulfoxide at neutral or alkaline pH. At acidic pH, chloramine-T will modify both methionine and tryptophan (Schechter, Y., Burstein, Y. and Patchomik, A. (1975) Biochemistry 14 (20) 4497-4503). FIG. 17 shows the inactivation of B. licheniformis alpha-amylase with chloramine-T at pH 8.0 (AA20=0.65 mg/ml, M197A=1.7 mg/ml, M197L=1.7 mg/ml). The data shows that by changing the methionine at position 197 to leucine or alanine, the inactivation of alpha-amylase can be prevented. Conversely, as shown in FIG. 18, at pH 4.0 inactivation of the M197A and M197L proceeds, but require more equivalents of chloramine-T (FIG. 18; AA20=0.22 mg/ml, M197A=4.3 mg/ml, M197L=0.53 mg/ml; 200 mM NaAcetate at pH 4.0). This suggests that a tryptophan residue is also implicated in the chloramine-T mediated inactivation event. Furthermore, tryptic mapping and subsequent amino acid sequencing indicated that the tryptophan at position 138 was oxidized by chloramine-T (data not shown). To prove this, site-directed mutants were made at tryptophan 138 as provided below:

Preparation of Alpha-Amvlase Double Mutants W138 and M197

Certain variants of W138 (F, Y and A) were made as double mutants, with M197T (made as per the disclosure of Example 3). The double mutants were made following the methods described in Examples 1 and 3. Generally,single negative strands of DNA were prepared from an M13MP18 clone of the 1.72kb coding sequence (Pst l-Sst l) of the B. licheniformis alph-amylase M197T mutant. Site-directed mutagenesis was done using the primers listed below, essentially by the method of Zoller, M. et al. (1983) except T4 gene 32 protein and T4 polymerase were substituted for klenow. The primers all contained unique sites, as well as the desired mutation, in order to identify those clones with the appropriate mutation. ##STR11##

Mutants were identified by restriction analysis and W138F and W138Y confirmed by DNA sequencing. The W138A sequence revealed a nucleotide deletion between the unique BspE l and Xma l sites, however, the rest of the gene sequenced correctly. The 1.37kb Sstll/Sstl fragment containing both W138X and M197T mutations was moved from M13MP18 into the expression vector pBLapr resulting in pBLapr (W138F, M197T) and pBLapr (WI38Y, M197T). The fragment containing unique BspE l and Xma l sites was cloned into pBLapr (BspE l, Xma l, M197T) since it is useful for cloning cassettes containing other amino acid substitutions at position 138.

Single Mutations at Amino Acid Position 138

Following the general methods described in the prior examples, certain single variants of W138 (F, Y, L, H and C) were made.

The 1.24kb Asp718-Sstl fragment containing the M197T mutation in plasmid pBLapr (W138X, M197T) of Example 7 was replaced by the wild-type fragment with methionine at 197, resulting in pBLapr (W138F), pBLapr (W138Y) and pBLapr (BspE l, Xma l).

The mutants W138L, W138H and W138C were made by ligating synthetic cassettes into the pBLapr (BspE l, Xma l) vector using the following primers:

    __________________________________________________________________________     Tryptophan 138 to Leucine                                                                   Seq ID No 45                                                      CC GGA GAA CAC CTA ATT AAA GCC CTA ACA CAT TTT CAT TTT C                       Tryptophan 138 to Histidine                                                                 Seq ID No 46                                                      CC GGA GAA CAC CTA ATT AAA GCC CAC ACA CAT TTT CAT TTT C                       Tryptophan 138 to Cysteine                                                                  Seq ID No 47                                                      CC GGA GAA CAC CTA ATT AAA GCC TGC ACA CAT TTT CAT TTT C                       __________________________________________________________________________

Reaction of the double mutants M197T/W138F and M197T/W138Y with chloramine-T was compared with wild-type (AA20=0.75 mg/ml, M197T/W138F=0.64 mg/ml, M197T/W138Y =0.60 mg/ml; 50 mM NaAcetate at pH 5.0). The results shown in FIG. 19 show that mutagenesis of tryptophan 138 has caused the variant to be more resistant to chloramine-T.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 68                                                  (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 56 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        GATCAAAACATAAAAAACCGGCCTTGGCCCCGCCGGTTTTTTATTATTTTTGAGCT56                     (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 29 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        TGGGACGCTGGCGCAGTACTTTGAATGGT29                                                (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 34 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        TGATGCAGTACTTTGAATGGTACCTGCCCAATGA34                                           (2) INFORMATION FOR SEQ ID NO:4:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 36 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                        GATTATTTGTTGTATGCCGATATCGACTATGACCAT36                                         (2) INFORMATION FOR SEQ ID NO:5:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 26 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                        CGGGGAAGGAGGCCTTTACGGTAGCT26                                                   (2) INFORMATION FOR SEQ ID NO:6:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                        GCGGCTATGACTTAAGGAAATTGC24                                                     (2) INFORMATION FOR SEQ ID NO:7:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 23 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                        CTACGGGGATGCATACGGGACGA23                                                      (2) INFORMATION FOR SEQ ID NO:8:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 35 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                        CTACGGGGATTACTACGGGACCAAGGGAGACTCCC35                                          (2) INFORMATION FOR SEQ ID NO:9:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 36 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                        CCGGTGGGGCCAAGCGGGCCTATGTTGGCCGGCAAA36                                         (2) INFORMATION FOR SEQ ID NO:10:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 45 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                       CATCAGCGTCCCATTAAGATTTGCAGCCTGCGCAGACATGTTGCT45                                (2) INFORMATION FOR SEQ ID NO:11:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 36 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                       GATTATTTGGCGTATGCCGATATCGACTATGACCAT36                                         (2) INFORMATION FOR SEQ ID NO:12:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 21 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                       GGGAAGTTTCGAATGAAAACG21                                                        (2) INFORMATION FOR SEQ ID NO:13:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 38 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                       GTCGGCATATGCATATAATCATAGTTGCCGTTTTCATT38                                       (2) INFORMATION FOR SEQ ID NO:14:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                       CGAATGAAAACGGCAACTATGATTATTTGATCTATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:15:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                       CGAATGAAAACGGCAACTATGATTATTTGTTCTATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:16:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                       CGAATGAAAACGGCAACTATGATTATTTGGTTTATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:17:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                       CGAATGAAAACGGCAACTATGATTATTTGAGCTATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:18:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                                       CGAATGAAAACGGCAACTATGATTATTTGCCTTATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:19:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                                       CGAATGAAAACGGCAACTATGATTATTTGACATATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:20:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                                       CGAATGAAAACGGCAACTATGATTATTTGTACTATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:21:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:                                       CGAATGAAAACGGCAACTATGATTATTTGCACTATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:22:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                                       CGAATGAAAACGGCAACTATGATTATTTGGGCTATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:23:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:                                       CGAATGAAAACGGCAACTATGATTATTTGCAATATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:24:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:                                       CGAATGAAAACGGCAACTATGATTATTTGAACTATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:25:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:                                       GCAATGAAAACGGCAACTATGATTATTTGAAATATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:26:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:                                       CGAATGAAAACGGCAACTATGATTATTTGGATTATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:27:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:                                       CGAATGAAAACGGCAACTATGATTATTTGGAATATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:28:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:                                       CGAATGAAAACGGCAACTATGATTATTTGTGTATTGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:29:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:                                       CGAATGAAAACGGCAACTATGATTATTTGTGGTATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:30:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 41 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:                                       CGAATGAAAACGGCAACTATGATTATTTGAGATATGCCGAC41                                    (2) INFORMATION FOR SEQ ID NO:31:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1968 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:                                       AGCTTGAAGAAGTGAAGAAGCAGAGAGGCTATTGAATAAATGAGTAGAAAGCGCCATATC60                 GGCGCTTTTCTTTTGGAAGAAAATATAGGGAAAATGGTACTTGTTAAAAATTCGGAATAT120                TTATACAACATCATATGTTTCACATTGAAAGGGGAGGAGAATCATGAAACAACAAAAACG180                GCTTTACGCCCGATTGCTGACGCTGTTATTTGCGCTCATCTTCTTGCTGCCTCATTCTGC240                AGCAGCGGCGGCAAATCTTAATGGGACGCTGATGCAGTATTTTGAATGGTACATGCCCAA300                TGACGGCCAACATTGGAAGCGTTTGCAAAACGACTCGGCATATTTGGCTGAACACGGTAT360                TACTGCCGTCTGGATTCCCCCGGCATATAAGGGAACGAGCCAAGCGGATGTGGGCTACGG420                TGCTTACGACCTTTATGATTTAGGGGAGTTTCATCAAAAAGGGACGGTTCGGACAAAGTA480                CGGCACAAAAGGAGAGCTGCAATCTGCGATCAAAAGTCTTCATTCCCGCGACATTAACGT540                TTACGGGGATGTGGTCATCAACCACAAAGGCGGCGCTGATGCGACCGAAGATGTAACCGC600                GGTTGAAGTCGATCCCGCTGACCGCAACCGCGTAATTTCAGGAGAACACCTAATTAAAGC660                CTGGACACATTTTCATTTTCCGGGGCGCGGCAGCACATACAGCGATTTTAAATGGCATTG720                GTACCATTTTGACGGAACCGATTGGGACGAGTCCCGAAAGCTGAACCGCATCTATAAGTT780                TCAAGGAAAGGCTTGGGATTGGGAAGTTTCCAATGAAAACGGCAACTATGATTATTTGAT840                GTATGCCGACATCGATTATGACCATCCTGATGTCGCAGCAGAAATTAAGAGATGGGGCAC900                TTGGTATGCCAATGAACTGCAATTGGACGGTTTCCGTCTTGATGCTGTCAAACACATTAA960                ATTTTCTTTTTTGCGGGATTGGGTTAATCATGTCAGGGAAAAAACGGGGAAGGAAATGTT1020               TACGGTAGCTGAATATTGGCAGAATGACTTGGGCGCGCTGGAAAACTATTTGAACAAAAC1080               AAATTTTAATCATTCAGTGTTTGACGTGCCGCTTCATTATCAGTTCCATGCTGCATCGAC1140               ACAGGGAGGCGGCTATGATATGAGGAAATTGCTGAACGGTACGGTCGTTTCCAAGCATCC1200               GTTGAAATCGGTTACATTTGTCGATAACCATGATACACAGCCGGGGCAATCGCTTGAGTC1260               GACTGTCCAAACATGGTTTAAGCCGCTTGCTTACGCTTTTATTCTCACAAGGGAATCTGG1320               ATACCCTCAGGTTTTCTACGGGGATATGTACGGGACGAAAGGAGACTCCCAGCGCGAAAT1380               TCCTGCCTTGAAACACAAAATTGAACCGATCTTAAAAGCGAGAAAACAGTATGCGTACGG1440               AGCACAGCATGATTATTTCGACCACCATGACATTGTCGGCTGGACAAGGGAAGGCGACAG1500               CTCGGTTGCAAATTCAGGTTTGGCGGCATTAATAACAGACGGACCCGGTGGGGCAAAGCG1560               AATGTATGTCGGCCGGCAAAACGCCGGTGAGACATGGCATGACATTACCGGAAACCGTTC1620               GGAGCCGGTTGTCATCAATTCGGAAGGCTGGGGAGAGTTTCACGTAAACGGCGGGTCGGT1680               TTCAATTTATGTTCAAAGATAGAAGAGCAGAGAGGACGGATTTCCTGAAGGAAATCCGTT1740               TTTTTATTTTGCCCGTCTTATAAATTTCTTTGATTACATTTTATAATTAATTTTAACAAA1800               GTGTCATCAGCCCTCAGGAAGGACTTGCTGACAGTTTGAATCGCATAGGTAAGGCGGGGA1860               TGAAATGGCAACGTTATCTGATGTAGCAAAGAAAGCAAATGTGTCGAAAATGACGGTATC1920               GCGGGTGATCAATCATCCTGAGACTGTGACGGATGAATTGAAAAAGCT1968                           (2) INFORMATION FOR SEQ ID NO:32:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 483 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:                                       AlaAsnLeuAsnGlyThrLeuMetGlnTyrPheGluTrpTyrMetPro                               151015                                                                         AsnAspGlyGlnHisTrpLysArgLeuGlnAsnAspSerAlaTyrLeu                               202530                                                                         AlaGluHisGlyIleThrAlaValTrpIleProProAlaTyrLysGly                               354045                                                                         ThrSerGlnAlaAspValGlyTyrGlyAlaTyrAspLeuTyrAspLeu                               505560                                                                         GlyGluPheHisGlnLysGlyThrValArgThrLysTyrGlyThrLys                               65707580                                                                       GlyGluLeuGlnSerAlaIleLysSerLeuHisSerArgAspIleAsn                               859095                                                                         ValTyrGlyAspValValIleAsnHisLysGlyGlyAlaAspAlaThr                               100105110                                                                      GluAspValThrAlaValGluValAspProAlaAspArgAsnArgVal                               115120125                                                                      IleSerGlyGluHisLeuIleLysAlaTrpThrHisPheHisPhePro                               130135140                                                                      GlyArgGlySerThrTyrSerAspPheLysTrpHisTrpTyrHisPhe                               145150155160                                                                   AspGlyThrAspTrpAspGluSerArgLysLeuAsnArgIleTyrLys                               165170175                                                                      PheGlnGlyLysAlaTrpAspTrpGluValSerAsnGluAsnGlyAsn                               180185190                                                                      TyrAspTyrLeuMetTyrAlaAspIleAspTyrAspHisProAspVal                               195200205                                                                      AlaAlaGluIleLysArgTrpGlyThrTrpTyrAlaAsnGluLeuGln                               210215220                                                                      LeuAspGlyPheArgLeuAspAlaValLysHisIleLysPheSerPhe                               225230235240                                                                   LeuArgAspTrpValAsnHisValArgGluLysThrGlyLysGluMet                               245250255                                                                      PheThrValAlaGluTyrTrpGlnAsnAspLeuGlyAlaLeuGluAsn                               260265270                                                                      TyrLeuAsnLysThrAsnPheAsnHisSerValPheAspValProLeu                               275280285                                                                      HisTyrGlnPheHisAlaAlaSerThrGlnGlyGlyGlyTyrAspMet                               290295300                                                                      ArgLysLeuLeuAsnGlyThrValValSerLysHisProLeuLysSer                               305310315320                                                                   ValThrPheValAspAsnHisAspThrGlnProGlyGlnSerLeuGlu                               325330335                                                                      SerThrValGlnThrTrpPheLysProLeuAlaTyrAlaPheIleLeu                               340345350                                                                      ThrArgGluSerGlyTyrProGlnValPheTyrGlyAspMetTyrGly                               355360365                                                                      ThrLysGlyAspSerGlnArgGluIleProAlaLeuLysHisLysIle                               370375380                                                                      GluProIleLeuLysAlaArgLysGlnTyrAlaTyrGlyAlaGlnHis                               385390395400                                                                   AspTyrPheAspHisHisAspIleValGlyTrpThrArgGluGlyAsp                               405410415                                                                      SerSerValAlaAsnSerGlyLeuAlaAlaLeuIleThrAspGlyPro                               420425430                                                                      GlyGlyAlaLysArgMetTyrValGlyArgGlnAsnAlaGlyGluThr                               435440445                                                                      TrpHisAspIleThrGlyAsnArgSerGluProValValIleAsnSer                               450455460                                                                      GluGlyTrpGlyGluPheHisValAsnGlyGlySerValSerIleTyr                               465470475480                                                                   ValGlnArg                                                                      (2) INFORMATION FOR SEQ ID NO:33:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 511 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:                                       MetLysGlnGlnLysArgLeuTyrAlaArgLeuLeuThrLeuLeuPhe                               151015                                                                         AlaLeuIlePheLeuLeuProHisSerAlaAlaAlaAlaAlaAsnLeu                               202530                                                                         AsnGlyThrLeuMetGlnTyrPheGluTrpTyrMetProAsnAspGly                               354045                                                                         HisTrpLysArgLeuGlnAsnAspSerAlaTyrLeuAlaGluHisGly                               505560                                                                         IleThrAlaValTrpIleProProAlaTyrLysGlyThrSerGlnAla                               65707580                                                                       AspValGlyTyrGlyAlaTyrAspLeuTyrAspLeuGlyGluPheHis                               859095                                                                         GlnLysGlyThrValArgThrLysTyrGlyThrLysGlyGluLeuGln                               100105110                                                                      SerAlaIleLysSerLeuHisSerArgAspIleAsnValTyrGlyAsp                               115120125                                                                      ValValIleAsnHisLysGlyGlyAlaAspAlaThrGluAspValThr                               130135140                                                                      AlaValGluValAspProAlaAspArgAsnArgValIleSerGlyGlu                               145150155160                                                                   HisLeuIleLysAlaTrpThrHisPheHisPheProGlyArgGlySer                               165170175                                                                      ThrTyrSerAspPheLysTrpHisTrpTyrHisPheAspGlyThrAsp                               180185190                                                                      TrpAspGluSerArgLysLeuAsnArgIleTyrLysPheGlnGlyLys                               195200205                                                                      AlaTrpAspTrpGluValSerAsnGluAsnGlyAsnTyrAspTyrLeu                               210215220                                                                      MetTyrAlaAspIleAspTyrAspHisProAspValAlaAlaGluIle                               225230235240                                                                   LysArgTrpGlyThrTrpTyrAlaAsnGluLeuGlnLeuAspGlyPhe                               245250255                                                                      ArgLeuAspAlaValLysHisIleLysPheSerPheLeuArgAspTrp                               260265270                                                                      ValAsnHisValArgGluLysThrGlyLysGluMetPheThrValAla                               275280285                                                                      GluTyrTrpGlnAsnAspLeuGlyAlaLeuGluAsnTyrLeuAsnLys                               290295300                                                                      ThrAsnPheAsnHisSerValPheAspValProLeuHisTyrGlnPhe                               305310315320                                                                   HisAlaAlaSerThrGlnGlyGlyGlyTyrAspMetArgLysLeuLeu                               325330335                                                                      AsnGlyThrValValSerLysHisProLeuLysSerValThrPheVal                               340345350                                                                      AspAsnHisAspThrGlnProGlyGlnSerLeuGluSerThrValGln                               355360365                                                                      ThrTrpPheLysProLeuAlaTyrAlaPheIleLeuThrArgGluSer                               370375380                                                                      GlyTyrProGlnValPheTyrGlyAspMetTyrGlyThrLysGlyAsp                               385390395400                                                                   SerGlnArgGluIleProAlaLeuLysHisLysIleGluProIleLeu                               405410415                                                                      LysAlaArgLysGlnTyrAlaTyrGlyAlaGlnHisAspTyrPheAsp                               420425430                                                                      HisHisAspIleValGlyTrpThrArgGluGlyAspSerSerValAla                               435440445                                                                      AsnSerGlyLeuAlaAlaLeuIleThrAspGlyProGlyGlyAlaLys                               450455460                                                                      ArgMetTyrValGlyArgGlnAsnAlaGlyGluThrTrpHisAspIle                               465470475480                                                                   ThrGlyAsnArgSerGluProValValIleAsnSerGluGlyTrpGly                               485490495                                                                      GluPheHisValAsnGlyGlySerValSerIleTyrValGlnArg                                  500505510                                                                      (2) INFORMATION FOR SEQ ID NO:34:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 520 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:                                       MetArgGlyArgGlyAsnMetIleGlnLysArgLysArgThrValSer                               151015                                                                         PheArgLeuValLeuMetCysThrLeuLeuPheValSerLeuProIle                               202530                                                                         ThrLysThrSerAlaValAsnGlyThrLeuMetGlnTyrPheGluTrp                               354045                                                                         TyrThrProAsnAspGlyGlnHisTrpLysArgLeuGlnAsnAspAla                               505560                                                                         GluHisLeuSerAspIleGlyIleThrAlaValTrpIleProProAla                               65707580                                                                       TyrLysGlyLeuSerGlnSerAspAsnGlyTyrGlyProTyrAspLeu                               859095                                                                         TyrAspLeuGlyGluPheGlnGlnLysGlyThrValArgThrLysTyr                               100105110                                                                      GlyThrLysSerGluLeuGlnAspAlaIleGlySerLeuHisSerArg                               115120125                                                                      AsnValGlnValTyrGlyAspValValLeuAsnHisLysAlaGlyAla                               130135140                                                                      AspAlaThrGluAspValThrAlaValGluValAsnProAlaAsnArg                               145150155160                                                                   AsnGlnGluThrSerGluGluTyrGlnIleLysAlaTrpThrAspPhe                               165170175                                                                      ArgPheProGlyArgGlyAsnThrTyrSerAspPheLysTrpHisTrp                               180185190                                                                      TyrHisPheAspGlyAlaAspTrpAspGluSerArgLysIleSerArg                               195200205                                                                      IlePheLysPheArgGlyGluGlyLysAlaTrpAspTrpGluValSer                               210215220                                                                      SerGluAsnGlyAsnTyrAspTyrLeuMetTyrAlaAspValAspTyr                               225230235240                                                                   AspHisProAspValValAlaGluThrLysLysTrpGlyIleTrpTyr                               245250255                                                                      AlaAsnGluLeuSerLeuAspGlyPheArgIleAspAlaAlaLysHis                               260265270                                                                      IleLysPheSerPheLeuArgAspTrpValGlnAlaValArgGlnAla                               275280285                                                                      ThrGlyLysGluMetPheThrValAlaGluTyrTrpGlnAsnAsnAla                               290295300                                                                      GlyLysLeuGluAsnTyrLeuAsnLysThrSerPheAsnGlnSerVal                               305310315320                                                                   PheAspValProLeuHisPheAsnLeuGlnAlaAlaSerSerGlnGly                               325330335                                                                      GlyGlyTyrAspMetArgArgLeuLeuAspGlyThrValValSerArg                               340345350                                                                      HisProGluLysAlaValThrPheValGluAsnHisAspThrGlnPro                               355360365                                                                      GlyGlnSerLeuGluSerThrValGlnThrTrpPheLysProLeuAla                               370375380                                                                      TyrAlaPheIleLeuThrArgGluSerGlyTyrProGlnValPheTyr                               385390395400                                                                   GlyAspMetTyrGlyThrLysGlyThrSerProLysGluIleProSer                               405410415                                                                      LeuLysAspAsnIleGluProIleLeuLysAlaArgLysGluTyrAla                               420425430                                                                      TyrGlyProGlnHisAspTyrIleAspHisProAspValIleGlyTrp                               435440445                                                                      ThrArgGluGlyAspSerSerAlaAlaLysSerGlyLeuAlaAlaLeu                               450455460                                                                      IleThrAspGlyProGlyGlySerLysArgMetTyrAlaGlyLeuLys                               465470475480                                                                   AsnAlaGlyGluThrTrpTyrAspIleThrGlyAsnArgSerAspThr                               485490495                                                                      ValLysIleGlySerAspGlyTrpGlyGluPheHisValAsnAspGly                               500505510                                                                      SerValSerIleTyrValGlnLys                                                       515520                                                                         (2) INFORMATION FOR SEQ ID NO:35:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 548 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35:                                       ValLeuThrPheHisArgIleIleArgLysGlyTrpMetPheLeuLeu                               151015                                                                         AlaPheLeuLeuThrAlaSerLeuPheCysProThrGlyArgHisAla                               202530                                                                         LysAlaAlaAlaProPheAsnGlyThrMetMetGlnTyrPheGluTrp                               354045                                                                         TyrLeuProAspAspGlyThrLeuTrpThrLysValAlaAsnGluAla                               505560                                                                         AsnAsnLeuSerSerLeuGlyIleThrAlaLeuSerLeuProProAla                               65707580                                                                       TyrLysGlyThrSerArgSerAspValGlyTyrGlyValTyrAspLeu                               859095                                                                         TyrAspLeuGlyGluPheAsnGlnLysGlyThrValArgThrLysTyr                               100105110                                                                      GlyThrLysAlaGlnTyrLeuGlnAlaIleGlnAlaAlaHisAlaAla                               115120125                                                                      GlyMetGlnValTyrAlaAspValValPheAspHisLysGlyGlyAla                               130135140                                                                      AspGlyThrGluTrpValAspAlaValGluValAsnProSerAspArg                               145150155160                                                                   AsnGlnGluIleSerGlyThrTyrGlnIleGlnAlaTrpThrLysPhe                               165170175                                                                      AspPheProGlyArgGlyAsnThrTyrSerSerPheLysTrpArgTrp                               180185190                                                                      TyrHisPheAspGlyValAspTrpAspGluSerArgLysLeuSerArg                               195200205                                                                      IleTyrLysPheArgGlyIleGlyLysAlaTrpAspTrpGluValAsp                               210215220                                                                      ThrGluAsnGlyAsnTyrAspTyrLeuMetTyrAlaAspLeuAspMet                               225230235240                                                                   AspHisProGluValValThrGluLeuLysAsnTrpGlyLysTrpTyr                               245250255                                                                      ValAsnThrThrAsnIleAspGlyPheArgLeuAspGlyLeuLysHis                               260265270                                                                      IleLysPheSerPhePheProAspTrpLeuSerTyrValArgSerGln                               275280285                                                                      ThrGlyLysProLeuPheThrValGlyGluTyrTrpSerTyrAspIle                               290295300                                                                      AsnLysLeuHisAsnTyrIleThrLysThrAsnGlyThrMetSerLeu                               305310315320                                                                   PheAspAlaProLeuHisAsnLysPheTyrThrAlaSerLysSerGly                               325330335                                                                      GlyAlaPheAspMetArgThrLeuMetThrAsnThrLeuMetLysAsp                               340345350                                                                      GlnProThrLeuAlaValThrPheValAspAsnHisAspThrAsnPro                               355360365                                                                      AlaLysArgCysSerHisGlyArgProTrpPheLysProLeuAlaTyr                               370375380                                                                      AlaPheIleLeuThrArgGlnGluGlyTyrProCysValPheTyrGly                               385390395400                                                                   AspTyrTyrGlyIleProGlnTyrAsnIleProSerLeuLysSerLys                               405410415                                                                      IleAspProLeuLeuIleAlaArgArgAspTyrAlaTyrGlyThrGln                               420425430                                                                      HisAspTyrLeuAspHisSerAspIleIleGlyTrpThrArgGluGly                               435440445                                                                      ValThrGluLysProGlySerGlyLeuAlaAlaLeuIleThrAspGly                               450455460                                                                      AlaGlyArgSerLysTrpMetTyrValGlyLysGlnHisAlaGlyLys                               465470475480                                                                   ValPheTyrAspLeuThrGlyAsnArgSerAspThrValThrIleAsn                               485490495                                                                      SerAspGlyTrpGlyGluPheLysValAsnGlyGlySerValSerVal                               500505510                                                                      TrpValProArgLysThrThrValSerThrIleAlaArgProIleThr                               515520525                                                                      ThrArgProTrpThrGlyGluPheValArgTrpHisGluProArgLeu                               530535540                                                                      ValAlaTrpPro                                                                   545                                                                            (2) INFORMATION FOR SEQ ID NO:36:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 483 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36:                                       AlaAsnLeuAsnGlyThrLeuMetGlnTyrPheGluTrpTyrMetPro                               151015                                                                         AsnAspGlyGlnHisTrpLysArgLeuGlnAsnAspSerAlaTyrLeu                               202530                                                                         AlaGluHisGlyIleThrAlaValTrpIleProProAlaTyrLysGly                               354045                                                                         ThrSerGlnAlaAspValGlyTyrGlyAlaTyrAspLeuTyrAspLeu                               505560                                                                         GlyGluPheHisGlnLysGlyThrValArgThrLysTyrGlyThrLys                               65707580                                                                       GlyGluLeuGlnSerAlaIleLysSerLeuHisSerArgAspIleAsn                               859095                                                                         ValTyrGlyAspValValIleAsnHisLysGlyGlyAlaAspAlaThr                               100105110                                                                      GluAspValThrAlaValGluValAspProAlaAspArgAsnArgVal                               115120125                                                                      IleSerGlyGluHisLeuIleLysAlaTrpThrHisPheHisPhePro                               130135140                                                                      GlyArgGlySerThrTyrSerAspPheLysTrpHisTrpTyrHisPhe                               145150155160                                                                   AspGlyThrAspTrpAspGluSerArgLysLeuAsnArgIleTyrLys                               165170175                                                                      PheGlnGlyLysAlaTrpAspTrpGluValSerAsnGluAsnGlyAsn                               180185190                                                                      TyrAspTyrLeuThrTyrAlaAspIleAspTyrAspHisProAspVal                               195200205                                                                      AlaAlaGluIleLysArgTrpGlyThrTrpTyrAlaAsnGluLeuGln                               210215220                                                                      LeuAspGlyPheArgLeuAspAlaValLysHisIleLysPheSerPhe                               225230235240                                                                   LeuArgAspTrpValAsnHisValArgGluLysThrGlyLysGluMet                               245250255                                                                      PheThrValAlaGluTyrTrpGlnAsnAspLeuGlyAlaLeuGluAsn                               260265270                                                                      TyrLeuAsnLysThrAsnPheAsnHisSerValPheAspValProLeu                               275280285                                                                      HisTyrGlnPheHisAlaAlaSerThrGlnGlyGlyGlyTyrAspMet                               290295300                                                                      ArgLysLeuLeuAsnGlyThrValValSerLysHisProLeuLysSer                               305310315320                                                                   ValThrPheValAspAsnHisAspThrGlnProGlyGlnSerLeuGlu                               325330335                                                                      SerThrValGlnThrTrpPheLysProLeuAlaTyrAlaPheIleLeu                               340345350                                                                      ThrArgGluSerGlyTyrProGlnValPheTyrGlyAspMetTyrGly                               355360365                                                                      ThrLysGlyAspSerGlnArgGluIleProAlaLeuLysHisLysIle                               370375380                                                                      GluProIleLeuLysAlaArgLysGlnTyrAlaTyrGlyAlaGlnHis                               385390395400                                                                   AspTyrPheAspHisHisAspIleValGlyTrpThrArgGluGlyAsp                               405410415                                                                      SerSerValAlaAsnSerGlyLeuAlaAlaLeuIleThrAspGlyPro                               420425430                                                                      GlyGlyAlaLysArgMetTyrValGlyArgGlnAsnAlaGlyGluThr                               435440445                                                                      TrpHisAspIleThrGlyAsnArgSerGluProValValIleAsnSer                               450455460                                                                      GluGlyTrpGlyGluPheHisValAsnGlyGlySerValSerIleTyr                               465470475480                                                                   ValGlnArg                                                                      (2) INFORMATION FOR SEQ ID NO:37:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 487 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:37:                                       AlaAlaAlaAlaAlaAsnLeuAsnGlyThrLeuMetGlnTyrPheGlu                               151015                                                                         TrpTyrMetProAsnAspGlyGlnHisTrpLysArgLeuGlnAsnAsp                               202530                                                                         SerAlaTyrLeuAlaGluHisGlyIleThrAlaValTrpIleProPro                               354045                                                                         AlaTyrLysGlyThrSerGlnAlaAspValGlyTyrGlyAlaTyrAsp                               505560                                                                         LeuTyrAspLeuGlyGluPheHisGlnLysGlyThrValArgThrLys                               65707580                                                                       TyrGlyThrLysGlyGluLeuGlnSerAlaIleLysSerLeuHisSer                               859095                                                                         ArgAspIleAsnValTyrGlyAspValValIleAsnHisLysGlyGly                               100105110                                                                      AlaAspAlaThrGluAspValThrAlaValGluValAspProAlaAsp                               115120125                                                                      ArgAsnArgValIleSerGlyGluHisLeuIleLysAlaTrpThrHis                               130135140                                                                      PheHisPheProGlyArgGlySerThrTyrSerAspPheLysTrpHis                               145150155160                                                                   TrpTyrHisPheAspGlyThrAspTrpAspGluSerArgLysLeuAsn                               165170175                                                                      ArgIleTyrLysPheGlnGlyLysAlaTrpAspTrpGluValSerAsn                               180185190                                                                      GluAsnGlyAsnTyrAspTyrLeuMetTyrAlaAspIleAspTyrAsp                               195200205                                                                      HisProAspValAlaAlaGluIleLysArgTrpGlyThrTrpTyrAla                               210215220                                                                      AsnGluLeuGlnLeuAspGlyPheArgLeuAspAlaValLysHisIle                               225230235240                                                                   LysPheSerPheLeuArgAspTrpValAsnHisValArgGluLysThr                               245250255                                                                      GlyLysGluMetPheThrValAlaGluTyrTrpGlnAsnAspLeuGly                               260265270                                                                      AlaLeuGluAsnTyrLeuAsnLysThrAsnPheAsnHisSerValPhe                               275280285                                                                      AspValProLeuHisTyrGlnPheHisAlaAlaSerThrGlnGlyGly                               290295300                                                                      GlyTyrAspMetArgLysLeuLeuAsnGlyThrValValSerLysHis                               305310315320                                                                   ProLeuLysSerValThrPheValAspAsnHisAspThrGlnProGly                               325330335                                                                      GlnSerLeuGluSerThrValGlnThrTrpPheLysProLeuAlaTyr                               340345350                                                                      AlaPheIleLeuThrArgGluSerGlyTyrProGlnValPheTyrGly                               355360365                                                                      AspMetTyrGlyThrLysGlyAspSerGlnArgGluIleProAlaLeu                               370375380                                                                      LysHisLysIleGluProIleLeuLysAlaArgLysGlnTyrAlaTyr                               385390395400                                                                   GlyAlaGlnHisAspTyrPheAspHisHisAspIleValGlyTrpThr                               405410415                                                                      ArgGluGlyAspSerSerValAlaAsnSerGlyLeuAlaAlaLeuIle                               420425430                                                                      ThrAspGlyProGlyGlyAlaLysArgMetTyrValGlyArgGlnAsn                               435440445                                                                      AlaGlyGluThrTrpHisAspIleThrGlyAsnArgSerGluProVal                               450455460                                                                      ValIleAsnSerGluGlyTrpGlyGluPheHisValAsnGlyGlySer                               465470475480                                                                   ValSerIleTyrValGlnArg                                                          485                                                                            (2) INFORMATION FOR SEQ ID NO:38:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 32 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38:                                       MetLysGlnGlnLysArgLeuThrAlaArgLeuLeuThrLeuLeuPhe                               151015                                                                         AlaLeuIlePheLeuLeuProHisSerAlaAlaAlaAlaAlaAsnLeu                               202530                                                                         (2) INFORMATION FOR SEQ ID NO:39:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 33 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:39:                                       MetArgSerLysThrLeuTrpIleSerLeuLeuPheAlaLeuThrLeu                               151015                                                                         IlePheThrMetAlaPheSerAsnMetSerAlaGlnAlaAlaGlyLys                               202530                                                                         Ser                                                                            (2) INFORMATION FOR SEQ ID NO:40:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 35 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:40:                                       MetArgSerLysThrLeuTrpIleSerLeuLeuPheAlaLeuThrLeu                               151015                                                                         IlePheThrMetAlaPheSerAsnMetSerAlaGlnAlaAlaAlaAla                               202530                                                                         AlaAlaAsn                                                                      35                                                                             (2) INFORMATION FOR SEQ ID NO:41:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 32 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:41:                                       MetArgSerLysThrLeuTrpIleSerLeuLeuPheAlaLeuThrLeu                               151015                                                                         IlePheThrMetAlaPheSerAsnMetSerAlaGlnAlaAlaAsnLeu                               202530                                                                         (2) INFORMATION FOR SEQ ID NO:42:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 33 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42:                                       CACCTAATTAAAGCTTTCACACATTTTCATTTT33                                            (2) INFORMATION FOR SEQ ID NO:43:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 33 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:43:                                       CACCTAATTAAAGCTTACACACATTTTCATTTT33                                            (2) INFORMATION FOR SEQ ID NO:44:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 66 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44:                                       CCGCGTAATTTCCGGAGAACACCTAATTAAAGCCGCAACACATTTTCATTTTCCCGGGCG60                 CGGCAG66                                                                       (2) INFORMATION FOR SEQ ID NO:45:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 42 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:45:                                       CCGGAGAACACCTAATTAAAGCCCTAACACATTTTCATTTTC42                                   (2) INFORMATION FOR SEQ ID NO:46:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 42 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:46:                                       CCGGAGAACACCTAATTAAAGCCCACACACATTTTCATTTTC42                                   (2) INFORMATION FOR SEQ ID NO:47:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 42 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:47:                                       CCGGAGAACACCTAATTAAAGCCTGCACACATTTTCATTTTC42                                   (2) INFORMATION FOR SEQ ID NO:48:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:48:                                       GATGCAGTATTTCGAACTGGTATA24                                                     (2) INFORMATION FOR SEQ ID NO:49:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 26 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:49:                                       TGCCCAATGATGGCCAACATTGGAAG26                                                   (2) INFORMATION FOR SEQ ID NO:50:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:50:                                       CGAATGGTATGCTCCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:51:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:51:                                       CGAATGGTATCGCCCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:52:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:52:                                       CGAATGGTATAATCCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:53:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:53:                                       CGAATGGTATGATCCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:54:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:54:                                       CGAATGGTATCACCCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:55:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:55:                                       CGAATGGTATAAACCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:56:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:56:                                       CGAATGGTATCCGCCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:57:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:57:                                       CGAATGGTATTCTCCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:58:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:58:                                       CGAATGGTACACTCCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:59:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:59:                                       CGAATGGTATGTTCCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:60:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:60:                                       CGAATGGTATTGTCCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:61:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:61:                                       CGAATGGTATCAACCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:62:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:62:                                       CGAATGGTATGAACCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:63:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:63:                                       CGAATGGTATGGTCCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:64:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:64:                                       CGAATGGTATATTCCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:65:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:65:                                       CGAATGGTATTTTCCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:66:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:66:                                       CGAATGGTACTGGCCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:67:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:67:                                       CGAATGGTATTATCCCAATGACGG24                                                     (2) INFORMATION FOR SEQ ID NO:68:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:68:                                       CCGTCATTGGGACTACGTACCATT24                                                     __________________________________________________________________________ 

What is claimed is:
 1. DNA encoding a mutant alpha-amylase, the mutant alpha-amylase derived from Bacillus and comprising:(a) a substitution of an amino acid selected from the group consisting of threonine, leucine, alanine, arginine, asparagine, aspartic acid, glutamic acid, glycine, isoleucine, lysine, phenylalanine, proline, serine, valine, histidine and glutamine or a deletion of a methionine residue at a position equivalent to M197 in Bacillus licheniformis alpha-amylase; and/or (b) a substitution of an amino acid selected from the group consisting of leucine, threonine, asparagine, aspartate, serine, valine, and isoleucine for a methionine residue at a position equivalent to M15 in Bacillus licheniformis alpha amylase.
 2. Expression vectors encoding the DNA of claim
 1. 3. Host cells transformed with the expression vector of claim
 2. 4. A method of producing a mutant alpha-amylase with the host cell of claim 3 comprising the steps of:(a) preparing a culture medium useful for the growth and cultivation of said host cell; (b) cultivating said host cell under appropriate conditions to induce production of said alpha-amylase by said host cell; and (c) optionally separating said alpha-amylase from said host cells.
 5. The DNA according to claim 1, wherein said alpha amylase further comprises a substitution at an amino acid equivalent to W138 in Bacillus licheniformis alpha-amylase.
 6. DNA encoding an alpha amylase comprising an amino acid sequence corresponding to Seq. ID No.
 37. 7. Expression vectors encoding the DNA of claim
 6. 8. Host cells transformed with the expression vector of claim
 7. 9. DNA encoding a mutant alpha amylase, wherein said alpha-amylase has an amino acid sequence of SEQ ID NO:37 with a deletion or substitution of amino acid residue M15, M197 and/or W138.
 10. Expression vectors encoding the DNA of claim
 9. 11. Host cells transformed with the expression vector of claim
 10. 